12. Reinforcement Learning

C01 L01 A11 Reinforcement Learning

You can read about reinforcement learning gone awry in Microsoft's "Tay" Twitter bot in this article.

  • Human in the Loop (HITL) refers to having a human-moderator or data annotator that can help with quality control of a product.